HLT - NAACL - 2006 Computationally Hard Problems and Joint Inference in Speech and Language Processing

نویسندگان

  • Charles Sutton
  • Jeff Bilmes
  • Fernando Pereira
چکیده

Recent work on ranking, sampling and other approximate solutions to natural language processing problems indicate that researchers are coming back to the hard problems in speech and text, for which efficient algorithms are not known to exist. In addition, there has been increasing interest in moving away from systems that make chains of local decisions independently, and instead toward systems that make multiple decisions jointly using global information. The goal of this workshop is to bring together researchers working on NLP problems whose solutions are computationally hard—whether because the problem is not well modeled by only local features, or because the problem is best solved in a joint, rather than pipelined, manner. We are grateful to the program committee for providing thoughtful and helpful reviews of the submitted papers. We also thank our invited speakers, we thank the organizers of the main HLT/NAACL 2006 conference, without which this workshop would not be possible. Abstract A syntax-directed translator first parses the source-language input into a parse-tree, and then recursively converts the tree into a string in the target-language. We model this conversion by an extended tree-to-string transducer that have multi-level trees on the source-side, which gives our system more expressive power and flexibility. We also define a direct probability model and use a linear-time dynamic programming algorithm to search for the best derivation. The model is then extended to the general log-linear framework in order to rescore with other features like n-gram language models. We devise a simple-yet-effective algorithm to generate non-duplicate k-best translations for n-gram rescoring. Initial experimental results on English-to-Chinese translation are presented.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Deep Learning and Continuous Representations for Natural Language Processing

Deep learning techniques have demonstrated tremendous success in the speech and language processing community in recent years, establishing new state-ofthe-art performance in speech recognition, language modeling, and have shown great potential for many other natural language processing tasks. The focus of this tutorial is to provide an extensive overview on recent deep learning approaches to p...

متن کامل

Low-Dimensional Discriminative Reranking

The accuracy of many natural language processing tasks can be improved by a reranking step, which involves selecting a single output from a list of candidate outputs generated by a baseline system. We propose a novel family of reranking algorithms based on learning separate low-dimensional embeddings of the task’s input and output spaces. This embedding is learned in such a way that prediction ...

متن کامل

Accurate Parsing of the Proposition Bank

We integrate PropBank semantic role labels to an existing statistical parsing model producing richer output. We show conclusive results on joint learning and inference of syntactic and semantic representations.

متن کامل

Towards Natural Language Understanding of Partial Speech Recognition Results in Dialogue Systems

We investigate natural language understanding of partial speech recognition results to equip a dialogue system with incremental language processing capabilities for more realistic human-computer conversations. We show that relatively high accuracy can be achieved in understanding of spontaneous utterances before utterances are completed.

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2006